Datalog Relaunched: Simulation Unification and Value Invention
نویسندگان
چکیده
For reasoning on the Web, Datalog is lacking data extraction and value invention. This article proposes to overcome these limitations with “simulation unification” and “RDFLog”. Simulation unification is a non-standard unification inspired from regular path queries. Like standard unification, it yields bindings for variables in both terms to unify. Unlike standard unification, it does not try to make the two terms identical but instead to embed the query into the data. Simulation unification is decidable. Without variables, it has polynomial complexity. With variables it is, like standard unification, np-complete. We identify a number of interesting special cases of unification, e.g., in presence or absence of term injectivity. In particular, we show that simulation unification without term injectivity on tree data is linear and in presence of injectivity it is still polynomial even on unordered trees in contrast to the np-complete unordered tree inclusion problem. RDFLog is Datalog with arbitrary quantifier alternation: Blank nodes, i.e., existentially quantified variables, in rule heads may be governed by universally quantified variables, universally quantified variables by blank nodes. RDFLog’s declarative semantics is defined in terms of RDF entailment; its sound and complete operational semantics, in terms of Skolemization, standard Datalog evaluation, and un-Skolemization. We show that RDFLog limited to ∀∗∃∗ prefixes is (up to unique helper predicates) equivalent to RDFLog with full quantifier alternation. A lightweight implementation points to the efficiency of the approach.
منابع مشابه
Expressiveness of Semipositive Logic Programs with Value Invention
We study the expressive power of the relational query language wILOG 1 2 ;: of semipositive datalog programs extended with a mechanism of safe value invention. We adopt a semantics for value invention based on the use of Skolem functor terms. We show that this language expresses exactly the class of semimonotone queries, that is, the class of computable queries that are preserved under extensions.
متن کاملComputing with Features as Formulae
This paper extends the approach to feature structures developed in Johnson (1991a), which uses SchO'nfinkel-Bernays' formulae to express feature structure constraints. These are shown to be a disjunctive generalization of Datalog clauses, as used in database theory. This paper provides a fixed-point characterization of the minimal models of these formulae that serves as the theoretical foundati...
متن کاملResolution and Datalog Rewriting Under Value Invention and Equality Constraints
While Datalog is a golden standard for denotational query answering, it does not support value invention or equality constraints. The Datalog framework introduced by Gottlob faces these issues by considering rules with fresh variables in the head (known as tgds) or equalities in the head (known as egds). Several tractable classes have been identified, among which: (S) the class of sticky tgds; ...
متن کاملSynthesis of the Canonical Models for Database Integration Preserving Semantics of the Value Inventive Data Models
Recently families of sets of dependencies treated as the Datalog extensions were discovered for which the interpretation of queries becomes tractable. Such families are intended for inference of new, unknown values in the process of query answering. This paper considers such decidable classes of dependencies as the assets for creation of new data models (called in the paper the value inventive ...
متن کاملCoupling Constant Unification and LEP Data
The recent LEP data for gauge coupling constants constrain many grand unified models. In this paper, we study several possibilities for unification of gauge coupling constants. Without an intermediate scale, the minimal supersymmetric standard model with two Higgs doublets is the only possibility. For one intermediate scale, we present a few unification schemes without supersymmetry. One of the...
متن کامل